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Abstract 


In this paper, we present a modelling experiment developed to study systems of cities and processes 
of urbanisation in large territories over long time spans. Building on geographical theories of urban 
evolution, we rely on agent-based models to 1/formalise complementary and alternative hypotheses of 
urbanisation and 2/explore their ability to simulate observed patterns in a virtual laboratory. The paper is 
therefore divided into two sections : an overview of the mechanisms implemented to represent competing 
hypotheses used to simulate urban evolution; and an evaluation of the resulting model structures in their 
ability to simulate - efficiently and parsimoniously - a system of cities (the Former Soviet Union) over 
several periods of time (before and after the crash of the USSR). We do so using a modular framework of 
model-building and evolutionary algorithms for the calibration of several model structures. This project 
aims at tackling equifinality in systems dynamics by confronting different mechanisms with similar 
evaluation criteria. It enables the identification of the best-performing models with respect to the chosen 
criteria by scanning automatically the parameter space along with the space of model structures. 


1 . Introduction 

Simulation models of urban systems were first 
developed in the 1950s and 1960s as a way 
to understand the complexity of cities and to 
forecast trends and consequences of planning 
policies. Several formalisms (thermodynamics, 
general systems theory, synergetic, microsimu¬ 
lation) were used | Batty, 20M| , following fash¬ 
ions as well as opportunities arisen from ac¬ 


cess to new technologies [Heppenstall et al., 2012 


Pumain and Sanders, 20l^ . Each of the methods 
cited have their specific advantages and drawbacks, 
that we will not discuss here, but all of them 
provide a same opportunity and challenge that 


is linked to simulation and has not changed as 
the formalisms evolved. The opportunity that we 
focus on in this paper is the function of virtual 
laboratory that a computerised simulation model 
enables [ Carley, 1999] , which is of paramount in¬ 
terest for human and social sciences in which in 
vivo experiments are impossible or difficult. By 
allowing to implement competing and/or comple¬ 
mentary hypotheses into generative mechanism^ 
in a model testable against empirical data, this 
function of virtual laboratory makes the model a 
framework for the evaluation of the plausibility of 
different theories. However, simulation as a method 
and an epistemological way of testing theories 
gives way to a limitation known since Bertalanffy 


* Corresponding author: c.cottineau@ucl.ac.uk 

^that is, a set of interaction activities performed by entities (cities) |Hedstr5m and Ylikoski, 2010| producing emergent patterns. 
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I von Bertalanffy, 1968| as equifinality. It describes 
the fact that even when a model is performing well, 
one cannot infer that the underlying combination 
of mechanisms is the one operating "in real life", 
because several models can lead to the same results 
(many-to-one), and the same process can lead to 
several patterns (one-to-many) flO'Sullivan, 2004| . 
The problem of adequacy assessment of the model 
with real life (also known as ontological adequacy 
testing, cf. [Rossiter et ah, 2010| ) is twofold. First, 
many effective (or "real") processes can lead to 
the same observed situation within the target 
system that we aim to model. Second, inade¬ 
quate mechanisms implemented in the model can 
simulate in a satisfying way the situation under 
study. This causal challenge of identification of 
the generative mechanisms is a recurrent prob¬ 
lem in social simulation for explanatory purposes 
I Griine-Yanoff, 200^ |Elsenbroich, 2012] |, but one 
that is usually overlooked at the stage of results' 
analysis. 


We have tried in a previous study 
I Cherel et al., 20151 to tackle the one-to-many part 
of the challenge within the model, using an evo¬ 
lutionary algorithm to look for the maximum di¬ 
versity in patterns produced by a given set of 
mechanisms in a model of systems of cities. What 
we present here relies to the multiplicity of possible 
causes leading to the single historical trajectory 
observed empirically. We present a multimodelling 
framework that allows to combine different mech¬ 
anisms into a modular model evaluated against a 
unique set of evaluation criteria, enabling for the 
comparison of the performance of different model 
structures to simulate urbanisation and the evolu¬ 
tion of a system of cities. 


More precisely, we are interested in simulating 
the co-evolution of cities encompassed in a terri¬ 
torial system (typically a nation or a continent), 
and to reproduce the regular patterns or organised 
structures that are observed empirically in vari¬ 
ous systems of cities: their hierarchy, spacing and 
functional differentiation [Pumain, 1997| . Section [II| 


presents the patterns we aim to reproduce and the 
catalogue of theories and mechanisms on which we 
draw to compose the model. Section describes 
the multi-model and its implementation in our 
study case, as well as its exploration through multi¬ 
calibration. This exploration aims to explore the 
performance of different hypotheses in explaining 
the evolution of Soviet and post-Soviet cities. Sec- 
tion|IV]concludes on the study case and the method. 


II. A CATALOGUE OF POSSIBLE MECHANISMS OF 
URBANISATION 

"It may also be useful to think of com¬ 
plex geographical models as extensions of 
thought experiments, where the necessary 
and contingent implications of theories can 
be examined. Further, admitting that 'all 
models are wrong' is akin to the realisation 
in post-structural social science that multi¬ 
ple competing accoimts of the same settings 
are possible, and that faced with a diversity 
of accounts the context and intent of each 
must be an important element in the evalu¬ 
ation process. " [O'Sullivan, 20041 p.291] 

A large bunch of theories in geography, eco¬ 
nomics and natural sciences have tried to provide 
an account of the regularity of urbanisation pro¬ 
cesses and the structuration of a system of cities, 
through models and narratives. Keeping in mind 
the equifinality challenge, this means that we al¬ 
ready have a strong theoretical basis and several 
causal model candidates to confront with empirical 
regularities. Without being exhaustive on these the¬ 
ories, we try to provide an overview of the mecha¬ 
nisms that have been proposed in the literature. We 
then expose the kind of results that are achieved by 
statistical models using empirical data, and present 
the particularity of our study case. 


II.l. Competing theories 

Out of the three stylised facts describing the struc¬ 
ture of a system of city: the hierarchy of cities by 
size, their regular spacing and the functional differ¬ 
entiation [Pumain, 19971, the former has fostered 
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the larger body of research, and will be our main 
criterion for stating the performance of a simula¬ 
tion model, whereas the latter two remain hard to 
formalise and to compare over time and space with 
respect to the availability of urban information at 
a local level. Consequently, we present in larger 
details the competing theories aiming at explain¬ 
ing the regularity of city size distribution and its 
evolution over time, and quickly review theories of 
location and functional specialisation. 

The hierarchical organisation of city sizes rep¬ 
resents a "mystery" |Krugman, 1996c| that has 
intrigued many researchers, because of its reg¬ 
ularity and simplicity of description (the rank- 
size "rule") despite the complexity of urban 
functioning and interactions (for reviews, cf. 
[Cheshire, 1999[ |Pumain, 2006| ). [Auerbach, 1913 


Lotka, 1925) Singer, 1936| are known to be the 
first to formalise this regularity, leading to the 
famous rule that the size of a city is a power 
law function of its rank in the hierarchy, a rule 
particularized by [ Zipf, 194^ arguing that the 
Pareto exponent is expected to be -1. Alterna¬ 
tive mathematical descriptions have been pro¬ 
posed, for instance the lognormal distribution 
[Gibrat, 1931| Berry, 19^ [Eeckhout, 2004] . The 
two distributions however are close and tend to 
coincide in the middle-upper part of the hierarchy 
I Mitzenmacher, 2004| [Clauset et al., 2009} . 


ing to a lognormal distribution. The Simon model 
of preferential attachment [Simon, 1955} applied to 
cities generates power laws of city sizes by simu¬ 
lating an incremental creation of new urban blocks 
attached to existing urban clusters with a proba¬ 
bility dependent on their size [Krugman, 1996a |. 
Those two models are elegantly simple enough to 
generate hierarchical distributions, but they lack an 
explaining power to help understand urbanisation 
processes: "Scaling laws often are viewed as over¬ 
identified: they can be generated by a wide range 
of distinct models. It is essential to select specifica¬ 
tions that integrate in the model a significant part 
of the existing knowledge about towns and cities" 
[Pumain, 2004| p. 1]. Some attempts were made to 
characterise them in terms of exogenous shocks and 
contingent local policies (for example in Gabaix's 
model [Gabaix, 1999j ), but random growth models 
seem to miss the causal power of mechanism-based 
simulation models |Hedstr6m and Ylikoski, 2010|. 


Other accounts of urban growth models lead¬ 
ing to the observed stylized hierarchical struc¬ 
ture and evolution (deterministic in Dimou and 
Schaffar's typology [Dimou and Schaffar, 2011] , 
by opposition to random models) range into 
two broad categories: equilibrium models 

of externalities (such as [Henderson, 1974 


[Krugman, 1996b |) and evolutionary models of spa 
tial interactions (such as [Allen and Sanglier, 1981 


Because of this regularity, generative mod¬ 
els of power laws and lognormal distributions 
have logically been considered as candidate expla¬ 
nations [Mitzenmacher, 2004| [Clauset et al., 200^ . 
The most famous one is Gibrat's law [Gibrat, 1931 1 , 
which generates a lognormal distribution by a 
process of "proportional effect". This model of mul¬ 
tiplicative growth allocates randomly growth rate^ 
from independent distributions, at each short time 
step, to cities independently from their size. It is 
thus considered close to a random walk. It results 
in amplifications of urban growth and decline lead- 

^If not stated otherwise, growth rates in this paper refer to the 
population at the beginning of the period. 


Weidlich and Haag, 1983 [Bura et al., 1996| ). The 
former type of models formalizes a balance of cen¬ 
tripetal (sharing, matching, learning in the labour 
market for example) and centrifugal forces (pollu¬ 
tion and congestion) resulting in optimal sizes for 
cities given the current technology. The latter type 
of models rely on attractivity and spatial interac¬ 
tions of cities to explain the dynamic of competition 
and cooperation resulting in a regular hierarchy in 
the urban system. 

Location theories of cities typically build on two 
concepts to explain the regular spacing and empir- 

ratios of population variation during a time period by the total 
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ical distribution of cities in space: site and/or sit¬ 
uation advantages [Christaller, 19331 lUllman, 1941| 
Krugman, 1996b |Pumain, 1997} . Site advantages 
refer to natural features available at an absolute lo¬ 
cation (natural resources, harbour conditions, etc.), 
whereas situation advantages refer to centrality in 
a transportation network or an interface relative 
position for example. All the theories mentioned 
above rely necessarily on relations with (consis¬ 
tent) territories providing resources or distance 
constraints for instance, in order to explain urban 
coevolution. 


Theories of urban specialisation inherit from 
trade theories (via comparative and competitive 
advantages) and theories of product and innova¬ 
tion cycles. They state that cities have different 
features (size, situation, site, former specialisation) 
that make them akin to be more competitive to 
specialise in one economic sector in comparison 
with other cities, or to adopt an irmovation sooner 
or later than their neighbours. Depending on the 
current cycle of the product, those specialisations 
affect not only the sectoral composition of the ac¬ 
tive population and its economic output, but also 
confers an advantage to early adopters and de¬ 
fines further options for specialisation and growth 
iDuranton and Puga, 2001||Pumain et al., 2006|. 


innovation cycles through competition and cooper¬ 
ation, resulting in some complementarity of their 
specialisation. These local interactions and their 
consequences on the regular organisation of the 
system as a whole under spatial constraints could 
be thought of as "complex systems effects". 

• Size effects like agglomeration economies and 
urbanisation externalities illustrate a very direct 
and self-reinforcing cause for hierarchical differen¬ 
tiation. 

• Site effects explain the spatial location of growth 
processes around resource-rich areas for the related 
innovation cycle. 

• Situation effects illustrate the importance of 
the neighbouring relational environment (potential 
field, network accessibility, etc.) on a city's pattern 
of growth. 

• Territorial effects account for some exogenous 
(policy) shocks and the solidarity of urban tra¬ 
jectories in a common political space (through 
redistributive processes for example). 

We look for statistical evidences of these factors 
of urbanisation in the empirical literature before 
turning to our case study experiment, where we 
propose an implementation and evaluate the power 
of each of the five mechanisms into an agent-based 
(multi-) model. 


Finally, political factors and singular policies 
are usually considered necessary to explain dif¬ 
ferentiated urban growth (through administrative 
functions or investment policies targeting specific 
cities at a given moment in time). 

We compose our catalogue with mechanisms 
that formalize the theories cited above. They fall 
into five broad classes of generative mechanisms 
potentially accounting for the emergence of a struc¬ 
tured system of cities: 

• Spatial Interactions and diffusion allow for 
the exchange of information, money, goods and 
people. It thus makes cities co-evolve over time 
and adapt collectively to changing economic and 


II.2. Empirical Results from the Literature 


Spatial Interactions are tricky to measure be¬ 
cause of the variety and non-commensurability of 
flows circulating between cities at various tempo¬ 
ralities. Until recently, the diffusion of innova¬ 
tions (agricultural techniques [ Hagerstrand, 196^ , 
telephone lines [Robson, 1973} or newspapers 
[Fred, 1973|| ) served as a proxy for these interac¬ 
tions. Since the development of various volumes of 
high velocity data, actual interactions (like phone 
calls I Krings et al., 200^ ) have confirmed for exam¬ 
ple the relevance of the gravity model to describe 
inter-city interactions. 


Size effects in urban growth and differentia¬ 
tion were revealed by a persistent empirical cor- 
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relation between growth rates and city sizes over 
long periods of time. All over the 19th century, 
[Robson, 1973| p. 79] measured a positive co¬ 
efficient between the log of English and Welsh 
cities' population and their ten-year growth rates 
(from a minimum of -1-1.47 between 1861 and 1871 
to a maximum of -1-8.53 between 1821 and 1831). 
This correlation is found for French cities as well 
I Guerin-Pace, 1995| . The size effect finally relates 
to the stability of the rank position of large cities (by 
comparison with the fluctuations of smaller cities). 


Site effects were classically approached by 
estimating the surplus of growth associated with 
a localised resource (typically, deposits of natural 
materials, such as coal or gas). In the Soviet urban 
system of the 1920s-1930s, the location of a city 
on a coal deposit was associated with a surplus of 
1.15 points in percentage of average demographic 
growth per armum, everything else being equal. A 
surplus over 0.5 point is observed nowadays (1989- 
2010) for oil and gas deposits [Cottineau, 2014a| . 
In the USA, [Black and H^enderson, 200^ estimate 
that the coastal location (e.g. a resource for tourism) 
is associated with a significantly higher ten-year 
growth rates of 3 to 5 points. 


Territorial effects can be approached empir¬ 
ically by relating political statuses to dynamics of 
growth. In developing countries, regional capitals 
were found to grow significantly faster by 0.5 to 1 
point of annual average growth rate in the 1960s 
[Preston, 1979| and the 1990s [Brockerhoff, 199^ . 
In the Former Soviet Union, the regional status 
of capital has proven important to predict urban 
growth [Harris, 1970[ , the coefficient regressed 
against growth rates over time ranges from -1-0.24 
point between 1989 and 2002 to -1-1.88 between 1926 
and 1939 [Cottineau, 2014a| . Besides, cities that be¬ 
long to the same territory have shown an increased 
pattern of synchronicity in their growth and decline 
trajectories from the 1980s on, suggesting evidence 
of both political shocks and territorial solidarity in 
the spatial distribution of urban growth. 


Despite the corroboration of theories provided 
by the numbers cited above, statistical correlations 
do not prove any causal chain and fail to explain the 
processes at work. Moreover, they are not adapted 
to model dynamic, non-linear and complex evolu¬ 
tions and interactions. This is a recurrent problem 
that has led social scientists to promote a combina¬ 
tion of statistical methods with generative (simula¬ 
tion) modelling [Byrne, 1998[ [Goldthorpe, 2001) . 


Situation effects can be revealed by the spa¬ 
tial autocorrelation of growth or the coevolution 
between transportation networks and urban net¬ 
works. In the first case, [Hernando et al., 20151 


found a characteristic distance for spatial autocorre¬ 
lation of growth rates of 215 km for American coun¬ 
ties and of 80 km for Spanish cities. As for trans¬ 
portation dynamics, j Bretagnolle, 2003^ measured 
the correlation between accessibility and growth 
rates for French cities in the last two centuries. She 
finds that cities that were weakly cormected (by any 
transportation network: road, rail or air) in 1900 
and stayed isolated in 2002 grew slower (0.94% on 
average per annum between 1900 and 2002) than 
cities that became motorway nodes (1.21%) or mul¬ 
timodal hubs (1.69%). Fikewise, well connected 
cities at the begirming of the period tended to grow 
faster than the first category. 


II.3. A Case Study : the Former Soviet Union 

We applied such a mixed methodology to the 
case study of the urban Soviet Union of the last 


50 years [Gottineau, 2014at [Cottineau et al., 2015[ . 
This system is mainly interesting to us because 
it has the reputation of having been a controlled 
economic, political and social system aiming at 
reaching equalisation (of city size for example) and 
yet, while observing its evolution with generic ur¬ 
ban models, the urban trends appear very classic. 
By this we mean for instance that the evolution 
of the percentage of urban population follows a 
very classical logistic function of urban transition, 
that the rank-size slope is close to the value ex¬ 
pected with respect to the time of settlement of 
the different parts of the territory, and that it has 
increased over time, indicating an increase of in- 
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equality of city sizes (despite anti-urban political 
discourses flCottineau, 2014a| ). The monographic 
explanation of urban growth in the Former Soviet 
Union has been dominant in urban geography, and 
we argue that it might have hidden very generic 
processes of urbanisation in this system of cities 
ICottineau and Slepukhina, forthcoming |. 


Our modelling experiment aims at testing the 
genericity of Soviet and post-Soviet urbanisation by 
simulating generic mechanisms and testing them 
against harmonised historical data. The database 
used in this study consists of open demographic, 
territorial and site informations on 1929 agglomera¬ 
tions of the Former Soviet Union between 1840 and 


2010 ICottineau, 2014b| . 


We analyse and compare the explaining power 
of generic mechanisms in a simulation of the period 
before (1959-1989) and the period after (1989-2010) 
the dislocation of the Soviet Union. The agent- 
based model framework of this experiment is called 
MARIU^ In this model, cities interact as collec¬ 
tive agents |Bura et al., 1996| and we evaluate every 
simulations with three evaluation criteria. The first 
two criteria are boolean indicators of the "realism" 
of the microdynamics of the simulation : we only 
analyse simulations in which the number of cities 
with a nil wealth and the number of cities pro¬ 
ducing more output in one year than their stock 
of wealth is 0. Once these controlling criteria are 
met, we try to minimise the distance 5 between the 
simulated population Pg and the observed popula¬ 
tion Po of each city i at each date t we have census 
information for, as stated in equation 0. 

t i 

This criterion ensures that a "perfect" model 
(achieved for = 0) would predict the right amount 
of urban growth and its exact location over time. 
In order to compare different time periods, we nor¬ 
malise 3 by the number of cities and the number of 
censuses used in a given period. 


ITT. Modular multimodelling experiment 


The incentive to implement competing and comple¬ 
mentary theories into different models evaluated 
against one another is a recurrent plea in the simu- 


lation literature | 

Openshaw, 1983 

lO'Sullivan, 2004 

Batty and Torrens, 2005 

IThiele and Grimm, 20151 

Batty, 20151. It reveals how tricky its implementa- 


tion and automatic evaluation might be, besides 
the epistemological challenge of equifinality and 
the kind of conclusions one can draw from this 
confrontation. Indeed, thirty years after the "Auto¬ 
mated Modeling System to Explore a Universe of 
Spatial Interaction Models" by [ Openshaw, 1983) , 
there are no standard tools nor formal methodology 
for theory testing with simulation models. Indeed, 
Openshaw's automated way to explore model struc¬ 
tures, being a pure optimisation way of discovering 
model structures, is impressive methodologically 
but it does not suit our goal of theory testing in a 
virtual laboratory, because it can result in optimal 
models that are impossible to interpret. 


Instead, we think that the first step should 
be to gather a catalogue of theoretical pro¬ 
cesses and mechanistic hypotheses working as 
potential explanations. This usually precedes 
the mixed-modelling step [Contractor et al., 2000t 
Auchincloss et al., 2008| and prevents from 
endlessly building models "from scratch" 
[Thiele and Grimm, 2015) ; it allows to build on 
previous work and experience. 


In a second step, we have built a baseline model 
of urban interactions and implemented h 5 q)otheses 
as blocks of mechanisms that can be activated or 
discarded to compose a family of simulation mod¬ 
els (section |III.1[ |. By automatically calibrating all 
model structures against a similar evaluation ob¬ 
jective (section [Ill.2| |, we were able to compare the 
power of several groups of hypotheses in the case 
of the Soviet and post-Soviet urbanisation (section 

11131 . 


^Models of Agglomerations in the perimeter of Imperial Russia and the former Soviet Union. For further details, see 
ICottineau et al., 2015| 
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III.l. Implementing mechanisms as building ters (economicMultiplier, sizeEffectOnSupply and 

blocks sizeEffectOnDemand). 


The multi-model is composed of the baseline model 
and modules of mechanisms that override the se¬ 
quence of agents' rules when they are activated. 
In the following sections, we detail the baseline 
model and the modular blocks of mechanisms that 
are added incrementally to the original equations. 
For further informations, the baseline model and 
the first two additional mechanisms were described 
and evaluated in details in [Cottineau et al., 2015| . 


III.l.l The Baseline Model 


Si = economicMultiplier x 
economic Multiplier > 0 (5) 

sizeEf fectOnSupply > 1 (6) 

Dj = economicMultiplier X p^’^^‘^^ff‘^‘^^OnDeniand 
sizeEf fectOnDemand > 1 (8) 


The baseline model relies on the assumption that 
population and wealth are the basic descriptors of 
cities and the engine of their coevolution. It there¬ 
fore models exclusively size effects of population 
on wealth and spatial interactions. 

At initialization, each city i of the Former Soviet 
Union is setup with its historical population P, at 
the beginning date of simulation, and located at 
its empirical coordinates, enabling site, situation 
and distance to play in the same geometry as in 
the target system. An estimated value of wealth W; 
(expressed in a fictive unit) is determined for each 
city with respect to its siz^ following equation (|^. 


_ ppopulationToWealth ^ 2 ^ 

populationToWealth > 1 (3) 

Time is modelled as discrete steps, each of 
which represents a time period of one year, during 
which interactions occur in a synchronous way. At 
each step: 


Each city interacts with other cities accord¬ 
ing to the intensity of their potential IP Q. For 
two distinct cities i and j, the computation of the 
interaction potential lP;y consists in confronting the 
supply of i pT] | to the demand of j with an equation 
borrowed to the gravity model ([l^. 


jp.. =_!_ I _ (9) 

YjfistanceDecay ^ ' 

distanceDecay > 0 (10) 

with dij a measure of distance between i and j. 

Interactions of cities i and ] based on their po¬ 
tential IPjj result in a transaction Ty from i to j 


Sij = Si X 


J±_ 

E)c ^Pik 


( 11 ) 


Dij = Di X 


JEn_ 

ZklPu 


( 12 ) 


Each city i updates its economic variables: a 
global supply S/ Q and a global demand Di 0, 
according to its population Pi and three parame- 


Tij = mm[Sij,Dji] (13) 


^This first mechanism is a first possibility of implementing the theoretical hypothesis of agglomeration economies. Wealth is 
indeed distributed superlinearly for each value of populationToWealth significantly greater than 1. 
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Each city updates its wealth W, according to 
the results of its transactions T (unsold supply LfS, 
(151 and unsatisfied demand UDj (16l) in which it 
was committed ([T^. 


n being the total number of cities in the system (i.e. 1145 
for a simulation begirming in 1959,1822 for a simulation 
beginning in 1989), and /, the number of partners with 
which i has engaged in the current simulation step. 


+ Si - D, - USi + UDi (14) 


= Wi,t + Bi 


(19) 


USi = Si-J^Tij (15) 

i 


UDi = Di-J2Tji (16) 

i 


A simulation step ends when each city up¬ 
dates its population according to its new result¬ 
ing wealth (|T7|) : 


ht = Pi,t- 


^ijWealthToPopulation ^,jWealthToPopulation 


economicMultiplier 


(17) 


III.1.2 Mechanism Increments 

The mechanism that accounts best for inter¬ 
actions benefits at the intercity level is the one 
called bonus. It 

"[...] features a non-zero sum game 
[...], rewarding cities who effectively in¬ 
teract with others rather than internally. 

We assume that the exchange of any unit 
of value is more profitable when it is 
done with another city, because of the 
potential spillovers of technology and 
information." |Cottineau et al., 2015|| 


A mechanism related to situation advantages 
is called fixed costs. It ensures that the situation of 
each city in the system is taken into account in its 
interactions with other cities. 


"Every interurban exchange gener¬ 
ates a fixed cost (the value of which 
is described by the free parameter 
fixedCost). This implies two features 
that make the model more realistic: first, 
no exchange will take place between 
two cities if the potential transacted 
value is under a certain threshold ; sec¬ 
ond, cities will select only profitable 
partners and not exchange with every 
other cities. This mechanism plays the 
role of a condition before the exchange." 
I Cottineau et al., 20151 

The interaction potential between city i and city 
i will be positive only if the potential value that i is 
willing to sell to ] is superior to the fixed value it 
costs it to negotiate, prepare and send the transac¬ 
tion ( [20| : 


IPrj = 



if Sij > fixedCost, 
otherwise. 


( 20 ) 


This bonus B; depends on the volume of trans¬ 
actions and the diversity of partners /, the city i has 
exchanged with ( [18| . It is added to the wealth at 
the end of each step ([T^, following equation ([l^: 


Therefore, each transaction of a city i gives way 
to a fixed cost. Their sum is subtracted from the 
wealth of city i at the end of each step ( |2T] |, follow¬ 
ing equation ([T4||: 


Bi = bonus Multiplier x 


ii:jTu + LiTn)xh 




(18) 


W; t = W, t — ]i X fixedCost (21) 
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Site effects are targeted by the resource mech¬ 
anism: site advantages are particularised in this 
model by natural resource deposits (more specifi¬ 
cally: coal deposits C on the one hand, and oil and 
gas deposits O on the other hand). The assump¬ 
tion is made that if the city i is located on some 
coal or oil deposits (C, = 1 or Oi = 1), the city 
benefits from the advantage granted by the extrac¬ 
tion activity The capacity of extraction depends on 
the capital (wealth) of the city and takes the form 
of a wealth multiplier for each resource ( [^ after 
equation 


y^i,t = X (1 + 


coalEffect 

oilAndGasEffect 


if Q = l, 
if Oi = l 
( 22 ) 


Territorial and political effects are for¬ 
malised by the redistribution mechanism. It al¬ 
lows for a redistribution of wealth between cities 
of the same territory R (region or State). To do so, 
territorial taxes ffj; are collected in each city /cr, as 
a proportion territorialTaxes of their wealth. The 
total amount of taxes collected is TTr^23\: 


TTr = = E territorialTaxes x W/^r (23) 

i i 

From this taxes, the administrative status of the 
territory R (denoted by CC,;r, set to 1 if i is the cap¬ 
ital city of the region and 0 otherwise) allows the 
capital city to take a share CS for its administration 
needs ( p4| : 


CSr = capitalShareOf Taxes x TTr (24) 

The rest of the taxes is redistributed to cities 
of region. Each city zr receives a share fr, r that is 
proportionate to its population ( [25| : 

Kr = {TTr - CSr) X (25) 

hk ^ k,R 

The balance of the territorial redistribution is 
added to the wealth of a city (|^ after equation 




— ^i,t — tti,R + l'>'i,R + 



if CQ,r = 1, 
otherwise 


(26) 


Finally, territorial and situation explanations 
are mixed in the urban transition mechanism. To 
account for the different opportunities of cities to 
attract rural migrants in the different regions, we 
model the evolution of the urban transition curves 
over time. As shown empirically | |Cottineau, 2014^ , 
100 out of the 108 regions of the Former Soviet 
Union have followed the scheme of the urban tran¬ 
sition. It means that their urbanisation rate Ur (in 
%) has followed a logistic function over time t ([^: 


UR,t = 


1 


100 

g—urbanisationSpeedxt 


(27) 


We have estimated the parameter 
urbanisations peed on empirical data and nor¬ 
malised the time units in order to position every 
region in a single urbanisation curve with respect 
to their time lags based on their urbanisation rate. 
The consequence of this initialisation process is 
that each region will move one step further on the 
urbanisation curve (leading eventually to 100%) at 
each simulation step, but that the rural potential to 
migrate in cities will depend on its current position 
on the urban transition curve (high potential for 
weakly urbanised regions, small potential for re¬ 
gions already very urban). The migration potential 
of each city i in territory R is built as a multiplier 
TMr specific to each region for each time step (28|: 


TMrj = 1 -|- (1 — Urj X rural Multiplier) (28) 


This extra population growth is added ( |2^ after 
and the new urbanisation rates of regions are 
updated for f -|-1 (27|: 

Pi,R,t = Pi,R,t X (1 + TMR,t) (29) 


Because we want to evaluate the contribution 
of each theoretical mechanism to the simulation of 
urbanisation and its itneraction with other mecha¬ 
nisms, we need a model that enables modules to be 
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activated and de-activated. The technical implemen¬ 
tation of MARIUS permits the modular functioning 
of the model, i.e. with or without any of its building 
blocks. 

III.1.3 Technical modular implementation 

In order to achieve a modular implementation of 
the model and its mechanisms, we leveraged the 
mixin-methods system of the Scala programming 
language [Scharli et al., 200^ . The mixins were 
first proposed at the begirming of the 90s in the Jig¬ 
saw programming language HBracha, 1992| . They 
are now adopted in mainstream languages such 
as Scala. It has been established as a powerful 
way of to perform type-safe dependency injection 
framework [ Wampler and Payne, 200^ , which is 
a powerful paradigm to achieve modularity. The 
mixin pattern allows to achieve type safe depen¬ 
dency injection. Mixins are therefore a suited tool 
to achieve modularity in model implementations, 
which we use to implement MARIUS in Scala lan¬ 
guage 

The mixin-methods concept j Steyaert et al., 19^ 
Lucas and Steyaert, 1994] generalizes object- 


object-oriented paradigm, one would implement 
subclasses of C and override the implementations 
of a and b in each subclass. For instance in the 
listing]^ the class Cl specialises C and defines im¬ 
plementations for the methods a and b. 

Listing 1: Object oriented specialisation 


1 

abstract class C extends A with B { 

2 

def 

a(x: Double) : 

Double 

3 

def 

b(x: Double) : 

Double 

4 

def 

c(x: Double) = 

/* Compute something 



using a and b 

*/ 

5 

> 



6 




7 

class 

Cl extends C { 


8 

def 

a(x: Double) = 

/* Some 



implementation 

of a */ 

9 

def 

b(x: Double) = 

/* Some 



implementation 

of b * / 

10 

1 




This pattern achieves 

a very low level of code 


reusability. Let a and b have 10 possible implemen¬ 
tations, then 100 specialised implementations of C 
would be required. The mixins method solves the 
problems of combinatorial explosion of the number 
of implementations by delaying the entanglement 
of the class components at the instantiation site. 
Listing 1^ exposes an implementation based on mix¬ 
ins providing alternative implementations of A and 
B and the corresponding parameter specifications. 
The implementation choice is delayed until the in¬ 
stantiation point (last lines of listing at which a 
mixin is defined. 


Listing 2: Mixin in Scala 


oriented programming to enable feature-oriented 
programming [ Prehofer, 1997|j . It makes the 
definition of class hierarchy more flexible 
[Steyaert and Meuter, 1995 j. Variations of isolated 
features (such as the different update functions 
of city wealth and population in our model) are 
defined in separate modules and mixed-in with 
each other at the object instantiation point, also 
called mixin application. The advantage of mixins (or 
trait) over classical object-oriented programming is 
the possibility of defining numerous variations of 
several features without increasing exponentially 
the number of specialized class. 

For instance, let's consider a class C implement¬ 
ing two methods a and t|^ To define alternative 
implementations of those methods in the classical 

® The source code of the model is available under open-licensing here: https: //github. com/ISCPIF/simpuzzle 

® We could think of class C as the interaction potential function, a representing the basic implementation of equation and b 
the fixed cost selection of equation [^ 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 


trait 

def 

1 

trait 

def 

} 

trait 

def 


A { 
a (x : 


B { 
b (x : 


Double): Double 


Double): Double 


C extends A with B { 

c(x: Double) = /* Compute something 
using a and b */ 
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11 } 

12 

13 // Implementation 1 of trait A 

14 trait A1 extends A { 

15 def a(x: Double) = /* Some 

implementation */ 

16 } 

17 

18 // Implementation 2 of trait A 

19 trait A2 extends A { 

20 // Parameter pO used in this version of 

a 

21 def pO: Double 

22 def a(x: Double) = /* Some 

implementation using pO */ 

23 } 

24 

25 // Implementation 1 of trait B 

26 trait B1 extends B { 

27 def b(x: Double) = /* Some 

implementation */ 

28 } 

29 

30 // Implementation 2 of trait B 

31 trait B2 extends B { 

32 def b(x: Double) = /* Some 

implementation */ 

33 } 

34 

35 val instancel = 

36 new C with A1 with B2 {I 

37 

38 val instance2 = 

39 new C with A2 with B2 f 

40 // Value for parameter pO of trait A2 

41 def pO = 1.0 

42 } 


Scala traits expressiveness can be leveraged to 
implement modular evolutive type-safe modelling 
frameworks proposing alternative model features, 
feature composability and the formalisation of the 
feature dependencies. The implementation of al¬ 
ternative behaviours in several traits provides the 
isolation of model component implementations and 
explicit dependencies between these components. 
Each component defines free parameters that have 
to be set at the model instantiation site, otherwise 
it won't compile. 

In this experiment we defined each alternative 
model mechanism in a particular trait. The exe¬ 


cutable model has then be composed by picking the 
traits we wanted to test. In order to evaluate concur¬ 
rently all the alternative mechanisms, we generated 
all the possible models (or combination of mech¬ 
anisms). It was achieved using a code generation 
algorithm which produces all the possible models 
implementations by generating a Scala source code 
containing all the possible traits combinations, such 
as the one shown on Listing 

Listing 3: Example of generated code 

1 

2 def model(index: Int, parameters: Seq[ 

Double]) = 

3 index match { 

4 case 0 => 

5 new Model with Til with T21 with 

. . . f 

6 def pO = parameters (0) 

7 def pi = parameters ( 1) 

8 

9 > 

10 case 1 => 

11 new Model with Til with T22 with 

. . . { 

12 def pO = parameters(0) 

13 def pi = parameters(1) 

14 

15 > 

16 case 2 => ... 

17 } 


This generated source code implements a single 
function encapsulating all the model alternatives. 
This function takes two arguments: an index of 
the implementation that shall be executed and a 
vector of parameters to set for the model (for this 
work we calibrated only double precision floating 
points values). Note that the vector has a fixed size 
which does not depends on the model instantiated. 
A given model implementation generally does not 
use all the parameters: instead it will use only some 
of them and ignore the others. 

III.2. Calibrating a multi-model 

"Whatever changes occur in the institu¬ 
tional, political and social context of com¬ 
putational models, the question of how to 
learn from models remains. It is clear that 
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assessmenf of fhe accuracy of a model as a 
representation must rest on argument about 
how competing theories are represented in 
its workings, with calibration and fitting 
procedures acting as a check on reasoning.” 
nO'Sullivan, 2004| p.291] 

In order to evaluate the capacity of the models 
to reproduce the historical trajectory of urbanisa¬ 
tion in the Former Soviet Union, we rely on an 
automated calibration. This procedure is part of 


the model evaluation | Grimm and Railsback, 2012 


Thiele et al., 2014||Schmitt et al., 201^ . Its aim is to 
find the values of parameters for which the model 
results match the fitness criteria (here: 5, the lowest 
possible distance to the data, the realistic criteria 
having been met). If the model can be calibrated, 
then it is shown that the mechanisms included 
in the model are sufficient to simulate the urban 
trajectory (not necessary though). If there are no 
parameter value for which the fitness criteria is 
met, then the combination of mechanisms is not 
sufficient to model urbanisation under the current 
implementation. 


In order to calibrate all the models at once, 
we designed a variant of the NSGAII genetic 
algorithm that includes a niching mechanism 
[Mahfoud, 1995| . Niching methods aim at pre¬ 
serving suboptimal solutions to preserve diversity. 
Our niching algorithm divided the population into 
sub-populations, each sub-population containing 
one model alternative. The genome of each individ¬ 
ual^ contains two parts. The first part is an integer 
value that corresponds to the index of the model 
alternative on which the genome is evaluated. The 
second part is a vector of double values containing 
the values of all the parameters for the model. 


In order to evaluate a genome, we designed 
a fitness function. This function calls the gener¬ 
ated function described above, runs the model and 
evaluates its dynamics using the fitness function 
described in section 111.31 (i.e. the criteria of realism 


of micro-dynamics and the distance between simu¬ 
lated and observed population data for each city at 
each census date). 


In NGSAII, the elitism operation preserves the 
best individuals among the whole population. The 
evaluation algorithm we applied has the exact same 
elitism strategy for each sub-population. No global 
elitism strategy was performed, instead we kept 
the 50 best-performing individuals in each sub¬ 
population (or model combination of mechanisms). 
In order to speed up the convergence of the algo¬ 
rithm, we also tweaked the mutation operation: it 
had a 10% chance of mutating the "model index" 
part of the genome. The new value for the "model 
index" was drawn uniformly among the possible 
model indexes. This allowed to periodically test 
on other models, some parameter values that were 
performing well on a given model. 


We then distributed this algorithm on the Eu¬ 
ropean grid EGI using the technique known as the 
"island model" in the same way as we described 
in pSchmitt et al., 201^ . We ran 200 000 jobs of 2 
hours. 


III.3. Hypothesis testing to explain urbanisa¬ 
tion in the Former Soviet Union 

After the multicalibration procedure, we end up 
with the best performing simulations of 64 different 
structures of model^ The analysis of calibra¬ 
tion results consists in relating the performance 
of these calibrated models (in terms of distance 
between simulated and observed growth hier¬ 
archically and spatially) to their structure and 
the values of parameters associated with each 
activated mechanism. This analysis was made 
using an interactive application available online: 
http://shiny.parisgeo.cnrs.fr/VARIUS, 

We present the results of this exploration in the 
form of three questions at the macro, meso and 


^An individual correspond to a vector of parameter values and structure index that defines the model under evaluation 
®For example: baseline model, baseline model + fixed cost, baseline model + urban transition + resources, baseline model + 
redistribution + urban transition, baseline model + fixed cost + bonus + resources, etc. 
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micro scale of the city-system. 

1/ Which is the most parsimonious model to 
simulate the evolution of cities before and after 
the collapse of the Soviet Union? For simula¬ 
tions starting in 1959 up to 1985|^ the best model 
with only one additional mechanism is composed 
of the baseline model plus the mechanism of urban 
transition (cf. Table [^. 

It is characterised by significant economies of 
agglomeration {sizeEffectOnSupply = 1.05) but a 
linear function of demand with size. The rural mul¬ 
tiplier is equal to 3.5% and allows to simulate fast 
urbanising regions of Siberia and Central Asia (cf. 
figure [^. However, the population of a majority 
of cities is under-estimated in the simulation, es¬ 
pecially in the upper part of the hierarchy, around 
Moscow and in eastern UkrainJ^ 


Table 1: Best simple model before the transition 


Parameter name 

Value 

Mechanism 

economicMultiplier 

0.002193758 

Generic 

populationToWealth 

1.000184755 

Generic 

sizeEffectOnSupply 

1.053943022 

Generic 

sizeEffectOnDemand 

1.000000000 

Generic 

wealthToPopulation 

0.203567639 

Generic 

distanceDecay 

1.872702086 

Generic 

ruralMultiplier 

0.034975771 

U rbanTransition 

Normalized <5 

n cities 

Time steps 

0.01423387 

1145 

30 


After the transition, the best performing model 
with one additional mechanism includes site ad¬ 
vantages for cities. This model has two additional 
parameters and meets the evaluation criteria (per 
city per census) twice better than the best model 
for the previous period (0.005 vs. 0.01, cf. tables 
and|^. 

The analysis of the parameters fits the empirical 
observations of faster growing cities located on oil 
and gas deposits (oilAndGasEffect = 0.02), and 


declining cities in the Donbass and Kuzbass coal 
regions {coalEffect = —0.01). This model's speci¬ 
fications include very low size effects and a very 
uneven wealth distribution amongst cities at initial¬ 
isation {populationToWealth = 1.12). The residuals 
are distributed roughly symmetrically (figure [^, 
but without any mechanism of urban transition, the 
model clearly underestimates the post-1989 growth 
of all the rapidly growing cities of Central Asia. 


Table 2: Best simple model after the transition 


Parameter name 

Value 

Mechanism 

economicMultiplier 

0.502616330 

Generic 

populationToWealth 

1.124963276 

Generic 

sizeEffectOnSupply 

1.002982515 

Generic 

SizeEffectOnDemand 

1.000808442 

Generic 

wealthToPopulation 

0.699943763 

Generic 

distanceDecay 

1.475836151 

Generic 

oilAndGazEffect 

0.017066495 

Resources 

coalEffect 

-0.011792670 

Resources 

Normalized S 

n cities 

Time steps 

0.005180008 

1822 

21 


To summarise, situation effects and territorial 
effects seem to be the dominant candidates for ex¬ 
plaining the specific part of the trajectories of cities 
in the Soviet era, while site effects seem to have 
taken over since 1991, the transition to capitalism 
and the rise of oil prices in the world markets. More¬ 
over, the better fit of the latter model could be an 
indication of the "normalization" of the urban pro¬ 
cesses in the post-Soviet space, compared to a more 
singular pattern of Soviet urbanisation. 

2/ Which are the mechanisms (and mecha¬ 
nisms' interactions) that are essential to model 
the Soviet and post-Soviet urbanisation patterns? 

We statistically analyse the results of the multicali¬ 
bration to evaluate the contribution of each mech¬ 
anism (everything else being equal in the model 
structure) to the reduction of distance between sim¬ 
ulated and observed demographic data for each 
city. 


®The Soviet Union actually came to an end in Decembre 1991. In this study, we rely on historical data and therefore consider 
the last Census date of the Soviet Union in 1989 as the transition point between the two economic, political and territorial regimes, 
^'^or more generally in the Western part of the Former Soviet Union. See question 3 for an hypothesis as to why that might be. 
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Figure 1: Residuals of the parsimonious models 

Urban Transition model. Simulation 1959-1989 I Resource Extraction model. Simulation 1989-2010 
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Figure 2: Contribution of mechanisms to a simulation that 
reduces the distance to observed data 


ft Ainn - 



Mechanism 


N.B.: Esfimafed coefficienfs are considered significanf 
for p-values inferior fo 0.005 


We find that on average, the bonus, fixed costs 
and urban transition mechanisms tend to reduce 
the simulation error significantly (cf. figure |^. The 
transition and bonus mechanisms are identified 
to be the most effective ones. By contrast, the re¬ 
source mechanism is correlated significantly with 
a change in the evaluation criteria, but tends to 
increase the error when it is activated (compared 
with model structures without this mechanism). 
This counter-intuitive result might be linked to 
the weak influence of resource extraction for the 
first period of simulation, when there is a diversity 
of urban trajectories within resource-rich regions. 
The redistribution mechanisms does not appear 
significant on average in this analysis, as it plays a 
minor role in the reduction of error when activated. 

Finally, we confirm the observation that our 
models work better to simulate the urban evolution 
following the dismantlement of the Soviet Union. 
Indeed, as shown in figure the value of normal¬ 
ized S is greater by 0.01 point on average when the 
model is specified for the first period. This could be 
an expression of "normalisation" or simplification 
of the urban processes in the post-Soviet space after 
the political and economic transition of the 1990s. 


3/ What are the cities that resist modelling? In 
other words, what are the cities that are too spe¬ 
cific to be modelled by any of the mechanisms 
implemented? To answer this last question, we 
statistically analyse the difference between the log 
of the population observed and the log of the popu¬ 
lation simulated at the last evaluation Census date 
for cities included in the simulation, with respect 
to their locational and functional attributes. The 
models for which we present the results below 
contains all the implemented mechanisms, and are 
applied to the two periods of enquiry. 

For the two periods, we find that our models 
persistently and significantly under-estimate the 
growth of the largest and most western cities of 
the (Former) Soviet Union, everything else being 
equal (cf. figure]^. Moreover, capital cities appear 
to have grown less historically than what we can 
predict with a complete model of the period 1959- 
1989. The other urban attributes included in the 
regressions (natural resources and mono-functional 
specialisation) do not seem associated with any 
systematic pattern of over- or under-estimation. 

The difficulty to reproduce the trajectory of 
the largest cities has been encountered for a com¬ 
parable model of system of cities (Simpop2, see 
[ [Bretagnolle and Pumain, 2010| ) and solved by the 
exogenous introduction of innovations to account 
for the creative features and higher probability of 
adoption of new technology and functions by the 
largest cities. 

The under-estimation of growth in the western 
part of the territory might be due to its integration 
within a larger area (the Eastern Europe) during 
the periods under study: our hypothesis here is 
that the centrality of western (post-)Soviet cities 
would then be minored in our model because it 
does not take into account the interactions with 
east-european cities (Warsaw, Prague, Bratislava, 
etc.) which formed altogether an economic system 
(even though the integration was always stronger 
within the FSU). 
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Figure 3: Profiles of residuals 

Complete model. Simulation 1959-1989 I Complete model. Simulation 1989-2010 



N.B.: Estimated coefficients are considered significant for p-values inferior fo 0.01 


Table 3: Observed and Simulated populations of urban out- Table 4: Observed and Simulated populations of urban out¬ 
liers in 1989 liers in 2010 


Positive Residuals _ _ Positive Residuals 


City 

Observed 

Simulated 


City 

Observed 

Simulated 

Naberezhnye Tchelny 

500 000 

30 000 


Mirn5rja 

41 000 

12 000 

Volgodonsk 

191 000 

36 000 


Sertolovo 

48 000 

16 000 

Chajkovskij 

86 000 

19 000 


Beineu 

32 000 

11 000 

Toljatti 

685 000 

158 000 


Govurdak 

76 000 

28 000 

Bratsk 

285 000 

73 000 


Serdar / Gyzylarbat 

98 000 

37 000 

Balakovo 

197 000 

52 000 


Bayramaly 

131 000 

53 000 

Tihvin 

71 000 

20 000 


Sarov 

92 000 

39 000 

Chervonograd 

72 000 

21 000 


Turkmenabat / Tchardjou 

427 000 

185 000 

Obninsk 

111 000 

32 000 


Astana / tselinograd 

613 000 

278 000 

Staryjoskol 

174 000 

53 000 


Dashougouz 

275 000 

126 000 


Negative Residuals Negative Residuals 


City 

Observed 

Simulated 


City 

Observed 

Simulated 

Zaozern)^] 

16 000 

54 000 


Sovetabad 

11 000 

33 000 

Gremjachnsk 

21 000 

56 000 


Zhanatas 

21 000 

50 000 

Atakent / Hitch 

15 000 

38 000 


Krasnozavodsk 

13 000 

31 000 

Kizel 

37 000 

88 000 


Gagra 

11 000 

25 000 

Cheremhovo 

74 000 

172 000 


Nevelsk 

12 000 

26 000 

Ilanskij 

18 000 

42 000 


Arkalyk 

28 000 

59 000 

Gornoaltajsk 

46 000 

102 000 


Ghyatura 

14 000 

28 000 

Volchansk 

15 000 

32 000 


Aleksandrovsk Sahalinsk 

11 000 

21 000 

Zujevka 

16 000 

35 000 


Uglegorsk 

10 000 

20 000 

Taldykorgai 

138 000 

296 000 


Baikonyr 

36 000 

67 000 
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Finally, some individual cities appear as clear 
outlyers of the model, and could correspond to a 
profile that is too specific to be modelled by any 
generic mechanism. For the first period, (cf. table 
1^, the most obvious examples of singular trajec¬ 
tories are the cities which grew much faster than 
what was expected from their site, situation or in¬ 
teraction attributes. Indeed, Naberezhnye Tchelny, 
Volgodonsk, Toljatti or Bratsk owe their sudden 
development to political decisions to implement 
flagship projects: automobile industry mega-plants 
in Naberezhnye Tchelny (trucks) and Toljatti (cars), 
energy production sites in Volgodonsk (atomic 
power) and Bratsk (hydroelectric power station). 
These economic policies of the 1950s and 1960s led 
those cities to be four times as populated 30 years 
lates than what was expected from their interac¬ 
tions, resource or regional characteristics. 


For the second period, (cf. table |^, Astana is 
a good example of a similar singular trajectory 
that we would not aim to simulate with generic 
urbanisation mechanisms, as it owes its boom¬ 
ing growth to the decision of the Kazakh newly 
independent State to locate its headquarters in 
this city more central to the country (compared 
to Almaty). On the contrary, Baikonyr, also in 
Kazakhstan, has suffered from the cuts in the 
space industry (non-predictable at the urban level 
of our mechanisms). Other shrinking cities like 
Aleksandrovsk-Sahalinsk, Krasnozavodsk or Ugle- 
gorsk would require more detailed mechanisms of 
demographics (lack of birth and emigration) and 
economic cycles to be simulated adequately. 


To summarise, there are particular types of ur¬ 
ban trajectories that are not simulated well by the 
model because of its simplicity, and trajectories that 
are too specific to be modelled. We find that the 
exploration of our models, their calibration and the 
analysis of residuals has helped to identify those 
cities and to suggest some missing mechanisms. 


IV. Conclusion 

"Despite the fact that the experience of in¬ 
dividual cities has become more varied in¬ 
ternationally (at least within what might 
be called the mature economies) there is 
stronger evidence of a predictable pattern of 
change, determined by common causal fac¬ 
tors, than might be expected given the diver¬ 
sity and variety of cities." [Cheshire, 1999| 
p. 1342] 

Systems of cities have attracted a lot of attention 
from social modellers because of the regularity of 
their patterns. Instead of regarding the profusion 
of competing theories as a source of confusion (or 
suspicion), we proposed a framework to integrate 
complementary accounts of urbanisation processes 
into a modular agent-based model. This work of 
synthesis and testing within a virtual laboratory 
has been made possible by its automation and the 
extensive use of computation resources to calibra¬ 
tion sixty-four models structures with empirical 
data on almost two thousands cities in the Former 
Soviet Union over the last fifty years. The model 
provides a basis for comparison of the different the¬ 
ories that can be augmented by new or alternative 
implementations of mechanisms. It could also be 
applied to different systems of cities (in space or 
time). 

In the present study, we showed that the multi¬ 
modelling approach has helped identify and order 
the mechanisms that most probably generated the 
urban pattern or Soviet and post-Soviet urbanisa¬ 
tion: situation effects of urban transition before 
1991 and resource extraction afterwards. The in¬ 
tuitive mechanism of redistribution however has 
proved unsignificant. This method was finally use¬ 
ful to spot the most singular trajectories of cities 
that we interpret with empirical and monographic 
knowledge to refine the model. 

To conclude, modelling experiments perform 
a radical compression of reality and an extreme 
simplification of individual trajectories, events and 
persons into synthetic aggregates. The ontological 
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adequacy of the model to real life is therefore neces¬ 
sarily evaluated at an aggregated level that creates 
the problem of equifinality. No simulation model 
can elude this question, but everything should be 
tried to reduce its impact on what can be learned 
from the modelling experience. 
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